PROCEEDINGS  of  the  FOURTH 
BERKELEY  SYMPOSIUM  ON 
MATHEMATICAL  STATISTICS 
AND  PROBABILITY 


Held  at  the  Statistical  Laboratory 
University  of  California 
June  20- July  30,  1960, 

with  the  support  of 
University  of  California 
National  Science  Foundation 
Office  of  Naval  Research 
Office  of  Ordnance  Research 
Air  Force  Office  of  Research 
National  Institutes  of  Health 


VOLUME  IV 


Contributions  to  Biology  and  Problems  of  Medicine 


Edited  by  JERZY  NEYMAN 


UNIVERSITY  OF  CALIFORNIA  PRESS 

BERKELEY  AND  LOS  ANGELES 

1961 


ON  GENERAL  LAWS  AND  THE 
MEANING  OF  MEASUREMENT  IN 
PSYCHOLOGY 

G.  RASCH 

THE  DANISH  INSTITUTE  OF  EDUCATIONAL  RESEARCH,  COPENHAGEN 

1.  Introduction 

Lately  the  statistical  methods  of  psychometrics  have  been  severely  criti¬ 
cized  in  psychological  quarters.  Thus  Skinner  [1]  maintains  that  if  order  is  to 
to  be  found  in  human  and  animal  behavior,  then  it  should  be  extracted  from 
investigations  into  individuals,  and  that  psychometric  methods  are  inadequate 
for  such  purposes  since  they  deal  with  groups  of  individuals.  And  as  regards 
abnormal  psychology  Zubin  [2]  states:  “Recourse  must  be  had  to  individual 
statistics,  treating  each  patient  as  a  separate  universe.  Unfortunately,  present 
day  statistical  methods  are  entirely  group-centered,  so  that  there  is  a  real  need 
for  developing  individual-centered  statistics.” 

In  a  recently  published  book  [3]  I  have  developed  three  models  for  reactions 
to  certain  attainment  tests  and  intelligence  tests.  Within  the  very  limited  areas 
covered,  these  models  represent  an  attempt  to  meet  this  challenge.  In  fact,  each 
model  specifies  a  distribution  function  for  the  potential  responses  of  a  given  per¬ 
son  to  a  given  stimulus  of  a  certain  set  of  allied  stimuli,  and  this  distribution 
function  depends  upon  a  parameter  characterizing  the  person  and  a  parameter 
characterizing  the  stimulus.  The  models  have  a  remarkable  property  in  common 
that  renders  it  possible,  in  the  analysis  of  the  data,  to  detach  the  personal  param¬ 
eters  from  the  stimulus  parameters,  and  vice  versa.  And  furthermore,  we  may 
check  the  adequacy  of  the  model  itself  independently  of  both  sets  of  parameters. 

The  present  paper  is  concerned  with  a  rather  large  class  of  models  sharing 
this  separability  property,  and  also  with  some  of  the  implications  of  this  type  of 
models. 


2.  A  model  for  tests  in  oral  reading 

Let  me  begin  with  a  description  of  two  of  the  above  mentioned  models  which 
pertain  to  static  situations,  leaving  the  third  one,  which  is  dynamical,  for 
another  occasion. 

A  large  number  v  =  1 ,  •  •  •  ,  n,  of  children  were  given  a  few  tests,  i  =  1 ,  •  •  •  ,  k, 
in  reading  aloud  and  on  each  occasion  the  number  of  misreadings  was  counted. 
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For  the  sake  of  simplicity,  we  shall  consider  only  the  cases  where  all  texts  were 
completed. 

For  reasons  not  to  be  discussed  here  (compare  chapter  2  in  [3])  the  number  of 
misreadings  xti  made  by  child  number  v  in  text  number  i  is  assumed  to  follow 
a  Poisson  law, 

(2.1)  P{xti}  =  e~h‘  J21- 

We  shall  attempt  to  describe  the  parameter  as  the  product  of  two  factors, 
one  pertaining  to  the  person  and  one  to  the  text, 

(2.2)  *„•  =  6,<Ti. 

At  this  stage  we  might,  in  keeping  with  the  usual  psychometric  reasoning, 
assume  that  the  0,  the  “underlying  continuum  of  inabilities,”  follow  some  dis¬ 
tribution.  A  normal  distribution  is  usually  preferred,  but  since  its  composition 
with  the  Poisson  law  would  be  hard  to  handle,  a  gamma  distribution  might  be 
tolerated  for  once. 

It  is,  however,  a  main  point  of  the  present  paper  that  for  models  like  (2.1)  any 
assumption  about  populations  in  an  ordinary  psychometrical  sense  is  superfluous 
as  far  as  comparisons  of  tests  and  comparisons  of  persons  go.  In  fact,  if  we  assume 
stochastic  independence  of  the  k  test  results  for  the  same  child  and  also  of  the  test 
results  for  different  children,  the  distribution  function  for  the  whole  set  of 
observations  ((&,»•)),  with  v  =  1,  •  •  •  ,  n  and  i  =  1,  •  •  •  ,  k,  becomes 

n«-IR 

(2.3)  ?{((*„■))}  =  e->-  - 1 - 

where  a  dot  indicates  summation  over  the  corresponding  index. 

From  (2.3)  it  is  easy  algebra  to  deduce  the  distribution  of  the  grand  total, 

(2.4)  P{*-> 

and  the  two  conditional  distributions  of  the  marginal  sums,  given  the  grand 
total, 

(2.5)  p{(*..)m  =  (*,,  ,  *..)  n  (£)~ 

(2.6)  P{(*,)|*..}  (;;)". 

It  is  also  easily  realized  that  for  given  x..  the  two  sets  of  marginal  sums  are 
stochastically  independent,  that  is, 

(2.7)  P{(x,.),  (*.<)!*..}  =  P{(z,.)|^.}P{(x.i)|:r..}. 

Finally,  (2.3)  with  (2.5)  and  (2.6)  leads  to 
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JX rr*'* !  IX x 

(2.8)  P{(M)\M,  (*•<)}  =  --TffYf  V 

*C-*  XXXI x* 

v  i 

Formulas  (2.5)  and  (2.6)  give  a  quite  clear  cut  separation  of  the  parameters: 
the  ratios  0„/0.  are  to  he  estimated  from  one  marginal ,  the  ratios  o-,/<r.  from  the  other 
one,  and  these  estimators  are  sufficient.  With  one  of  the  tests  chosen  as  reference, 
with  <r  —  1,  all  the  other  a  can  be  estimated  uniquely.  In  consequence,  0.  may 
be  estimated  from  (2.4),  and  accordingly,  the  estimates  of  the  0,  are  fixed. 

Formula  (2.8)  tells  us  that,  given  the  two  marginals,  the  total  set  of  observa¬ 
tions  obeys  a  distribution  which  is  independent  of  all  parameters.  In  consequence, 
we  may  check  the  model  without  involving  any  of  the  unknown  parameters.  In  order 
to  indicate  in  which  way  (2.8)  may  be  employed  for  such  control  purposes,  we 
consider  a  special  case  of  (2.6),  namely, 

(2-9)  P{xv\,  •  ,  xvk\xv)  =  (^Xh  ...  ,  x^j  JJ  ^ 

from  which  it  follows  that  the  expected  value  of  any  xVi  for  given  total  xv.  is 
proportional  to  xv., 

(2.10)  E[Xyi\Xy.]  =~  Xy.. 

<x . 

Thus,  if  we  collect  children  into  groups  according  to  their  total  number  of  mis¬ 
readings  in  the  k  tests  all  together  and,  for  each  group,  average  the  number  of 
misreadings  in  test  number  i,  these  averages  should,  apart  from  random  varia¬ 
tions  which  may  be  judged  by  a  x2-test,  increase  proportionally  to  xv.. 

This  test,  which  is,  by  the  way,  passed  beautifully  by  my  observations,  may 
be  conceived  as  an  application  of  (2.8)  in  which  we,  without  specifying  explicitly 
any  alternative  to  our  model  (2.1)  and  (2.2),  are  looking  for  trouble  in.  a  more 
or  less  definite  direction,  namely,  for  the  possibility  that  the  relative  difficulties 
of  the  tests  may  vary  with  xv.,  that  is,  with  the  reading  inability  of  the  children. 


3.  A  model  for  a  type  of  intelligence  test 


For  the  responses  to  the  items  of  an  intelligence  test  the  following  model 
is  suggested.  (For  the  background  of  this  choice,  see  chapter  5  in  [3].)  The 
probability  that  person  number  v  gives  a  correct  answer  to  item  number  i  is 


(3.1) 


OyOi 

1  -J-  0,<Ji 


assumed  independent  of  the  answers  to  the  preceding  questions. 

For  the  following  algebra  it  is  convenient  to  reformulate  this  model  in  terms 
of  another  random  variable  a,„  defined  as  1  if  the  answer  of  person  v  to  item  i 
is  correct,  0  if  it  is  not  correct.  In  both  cases  we  have 
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(3.2) 


P  {«»»} 


1  +  dv<Ji 


Under  the  assumption  of  independence  between  the  answers  to  different  items 
for  any  given  person,  as  well  as  between  the  answers  of  different  persons,  the 
probability  of  the  whole  set  of  answers 

(3.3)  A  (a^t),  v  =  1,  •  •  •  ,  n |  i  =  1,  *••,/?, 

of  the  n  persons  to  the  k  items  becomes 


(3.4) 

where  the  denominator  is 


P{A }  = 


n  n *• 

_ v _ i _ 

7(e,s) 


(3.5)  T(e,  s)  =  n  n  a  +  «.<’.)• 

v  i 

Since  P{A}  is  the  same  for  all  matrices  (3.3)  with  the  same  two  marginal 
vectors 


(3.6)  a*.  =  (oi.,  •  •  •  ,  an.),  a.*  =  (a. i,  •  •  •  ,  a.k), 

the  joint  distribution  of  these  vectors  is 


(3.7) 


<u 


n  n 

v _ i _ 

7(0,  2) 


where  the  coefficient  on  the  right  denotes  the  number  of  all  such  possible  matrices 
with  all  a„i  =  1  or  0.  On  summing  (3.7)  over  all  <2*.,  we  obtain  the  distribution 


(3.8) 
where 

(3.9) 


p<a->  -  SS  n 


Now,  by  dividing  (3.8)  into  (3.7)  we  obtain  the  conditional  distribution  of  one 
marginal  vector,  given  the  other, 


(3.10) 


n » 


To,.(2) 

and  a  symmetrical  argument  yields  the  symmetrical  formula 

n«- 


(3.11) 


*J  78^(0) 

Finally,  we  may  divide  (3.7)  into  (3.4),  thus  obtaining  the  conditional  dis- 
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tribution  of  the  total  matrix  of  observations,  given  the  two  marginal  vectors, 


(3.12) 


P{A\a, ,,  <U 


Formulas  (3.10),  (3.11),  and  (3.12)  form  the  analogues  to  (2.5),  (2.6),  and  (2.8), 
being,  however,  somewhat  weaker  since  a*,  and  a.*  for  given  a.,  do  not  appear  to 
be  stochastically  independent.  The  main  feature,  however,  has  been  retained. 

On  the  basis  of  (3.10)  we  may  estimate  the  item  parameters  independently  of  the 
personal  parameters ,  the  latter  having  been  replaced  by  something  observable,  namely, 
by  the  individual  total  number  of  correct  answers.  Furthermore,  on  the  basis  of 

(3.11)  we  may  estimate  the  personal  parameters  without  knowing  the  item  parameters 
which  have  been  replaced  by  the  total  number  of  correct  answers  per  item.  Finally, 

(3.12)  allows  for  checks  on  the  model  (3.2)  which  are  independent  of  all  of  the  param¬ 
eters,  relying  only  upon  the  observations. 


4.  Models  with  separate  parameters 

The  practical  results  of  applying  these  two  models  have  been  encouraging 
enough  to  justify  a  theoretical  study  of  their  basic  properties  as  a  preliminary 
to  attempts  at  covering  much  wider  fields  of  psychology.  Formally,  at  least,  we 
shall  confine  ourselves  to  the  types  of  experiments  or  observations  where  all  per¬ 
sons  in  question  have  been  exposed  to  the  same  set  of  stimuli  belonging  to  a 
certain,  potentially  large,  class  of  allied  stimuli.  And  at  present  we  shall  deal 
only  with  single  individuals  in  static  situations,  assuming  that  preceding  re¬ 
sponses  of  a  given  person  have  no  influence  upon  later  responses,  and  the  re¬ 
sponses  of  any  given  person  are  unaffected  by  the  responses  of  any  other  person. 
Situations  to  which  these  conditions  do  not  apply  may  be  considered  under  a 
dynamic  point  of  view,  such  as  processes  of  learning  or  of  adaptation,  or  as 
group  psychological  phenomena. 

In  passing  it  may  be  noted  that  this  assumption  of  independence  of  items  is 
conditional  to  a  given  “ability.”  It  is,  therefore,  not  at  variance  with  the  well- 
established  high  intercorrelations  between  items  which  are  produced  under  our 
model  by  the  strong  variation  of  the  “abilities”  in  the  “population”  considered. 

The  responses  may,  of  course,  be  quantitative,  such  as  reaction  times,  or 
estimated  sizes  or  velocities.  However,  they  may  just  as  well  be  qualitative, 
such  as  answers  to  a  questionnaire  or  solutions  to  problems  in  an  intelligence 
test,  which  are  recorded  as,  for  example,  correct,  incorrect,  or  not  answered. 

In  psychometric  practice  the  specification  of  the  statistical  model  for  the 
latter  type  of  data  is  usually  preceded  by  a  so-called  quantification  or  scoring 
of  the  qualitative  observations,  which  is  quite  often  nothing  more  than  an  enu¬ 
meration  of  possible  responses.  In  view  of  Dirichlet’s  definition  of  a  function  as 
just  a  correspondence  between  two  sets  of  elements,  such  procedures  would  seem 
unnecessary  as  prerequisites  to  mathematical  formulations  of  general  laws.  In 
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the  present  approach  we  shall,  therefore,  start  from  the  original  observations 
themselves,  quantitative  or  qualitative  as  the  case  may  be. 

In  accordance  with  the  two  models  already  considered  we  shall  attempt  to 
characterize  each  individual  v  by  a  parameter  8 ,  which,  however,  is  not  neces¬ 
sarily  one-dimensional.  And  similarly,  each  stimulus  i  will  be  characterized  by  a 
parameter  <r»  which  also  may  be  of  a  higher  dimension.  To  each  combination  of 
individual  and  stimulus  we  shall  assign  a  probability  distribution  on  the  set  of 
possible  responses.  For  simplicity  we  shall  on  this  occasion  consider  only  the 
case  of  a  finite  number  m  of  possible  responses,  to  be  denoted  x(1),  •  •  •  ,xim).  The 
probability  that  individual  v  gives  the  response  x,i  =  x,  which,  by  the  way,  also 
may  be  a  vector,  to  the  stimulus  i  is  assumed  to  be  determined  by  x  and  a  param¬ 
eter  which  is,  in  turn,  determined  entirely  by  8V  and  tr,-, 

(4.1)  £„,•  =  n(dy,  <TX). 

This  formulation  allows  for  the  possibility  that  the  characteristics  of  persons 
and  stimuli  show  some  sort  of  random  variation.  In  this  case  9,  and  are  inter¬ 
preted  as  parameters  in  probability  distributions. 

In  section  5  we  shall  refer  to  this  representation;  at  present  it  suffices  to  keep 
in  mind  that  the  probability  that  xvi  =  x  is  conditioned  by  9V  and  o-,-. 

In  the  proofs  of  the  separability  of  the  parameters  in  sections  2  and  3  the 
relative  sufficiency  played,  to  say  the  least,  an  important  instrumental  role.  Fol¬ 
lowing  this  lead  we  may  ask  what  conditions  are  required  for  the  existence  of 
sufficient  statistics  for  the  9 ,  provided  the  <r  are  known,  and  vice  versa.  From 
well-known  theorems  on  sufficiency  it  follows  that,  apart  from  a  normalizing 
function  depending  on  6,  and  <r,  only,  log  P{x\8„  <ri}  must  be  a  bilinear  function 
of  some  transformations  9,  and  <r<  of  9,  and  o-t,  respectively.  Therefore,  on  proper 
choice  of  8V  and  <r,  we  have 

(4.2)  P{x\9v,  <r,}  =  —7~ — r  exp  [<p(x)9„  +  ^(s)cr»  +  x(x)0yVi  +  pG*0]» 

where  <p,  x,  p  are  functions  of  x  only.  In  writing  (4.2),  it  is  presumed  that  the 
parameters  are  one-dimensional,  but,  in  order  to  make  the  formula  apply  to 
higher  dimensional  cases,  we  just  have  to  interpret  <p(x)8 „  and  z)<r,-  as  inner 
products  of  vectors,  while  x(z)  is  taken  to  be  a  matrix  and  xW®^;  to  be  a 
homogeneous  bilinear  form  in  the  elements  of  9,  and  crx.  For  the  sake  of  simpli¬ 
city,  most  of  the  formulas  in  the  following  will  be  expressed  as  if  9,  and  a,  were 
scalars,  but  the  above  reinterpretation  is  available  at  any  time. 

I  have  not  yet  studied  the  general  form  (4.2)  in  detail,  but  the  case  x(x)  —  0, 
for  which  there  exist  relatively  sufficient  estimators  for  the  8  which  are  independ¬ 
ent  of  the  a  and  vice  versa,  shows  several  features  of  considerable  interest. 

Preparatory  to  the  analysis  of  this  case  I  shall  change  the  stochastic  variable 
by  introducing  the  selection  vector 

(4.3)  ayi  =  (0,  •  •  •  ,  1,  •  •  •  ,  0)  =  (a8\  •  •  •  ,  a<f), 

which  has  elements  0  except  the  Mth  where  x,t  =  x(m).  Writing,  furthermore, 
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(4.4)  h  =  ^(xw),  pM  =  p(*w) 
and  introducing  the  vectors 

(4.5)  4>  =  (<pi,  •  •  •  ,  <pw),  *  =  (^i,  •  •  •  ,  &»),  P  =  (pi,  •  •  •  ,  pm), 

we  have 

(4.6)  P{ari}  =  z-1  -  exp  (0,G„t4>*  +  *,«,***  +  CUP*). 

y\Pv,  <n) 

Now  imagine  that  all  of  the  k  stimuli,  i  =  1,  •  •  •  ,  k  have  been  applied  to  each 
of  the  n  persons  v  =  1,  •  •  •  ,  n  and  that  we  obtain  a  selection  vector  for  a  response 
every  time.  We  may  then  build  up  a  matrix  of  order  (n,  k),  whose  elements  are 
the  selection  vectors  For  each  person  the  Gyi  add  to  a  total  vector  tt,.,  and 
similarly,  the  G„»  for  a  given  stimulus  add  to  a  total  vector  ft.,.  Out  of  these 
marginal  vectors,  whose  grand  total  is  Ct. we  form  the  ordinary  matrices 


(4.7)  A*. 

of  orders  (n,  m)  and  (k,  m). 
Noticing  that 

(4.8) 

and  writing 


A,= 


£  £  e,ayi  =  ©A,., 

v  i 

£  £  =  s A., 

v  i 


(4.9) 


7(e,  2)  =  nn  •»(«.,  o-,), 


we  obtain  for  the  distribution  of  the  whole  set  of  6Lvi 


(4.10) 


p/4\  _  exp  (eA„.3>*  +  ZA.aA'*  +  a..P*) 
1  1  "  7(9,  S) 


Since  P{A }  remains  the  same  for  all  matrices  ((£„,■)  with  the  same  marginal 
matrices  (4.7),  the  joint  distribution  of  A*,  and  A.*  becomes 


(4.11) 


Pi  A  A  \  =  exp  (0A^,4>*  +  ZA^  +  a..P*) 

1  LA  J  7(6,  S) 


where  the  coefficient  on  the  right  denotes  the  number  of  such  matrices  for  which 
the  elements  are  selection  vectors.  By  convention  the  coefficient  is  understood 
to  be  0  if  the  two  matrices  could  not  possibly  represent  the  two  sets  of  marginal 
vectors  of  the  same  A-matrix,  for  example,  if  some  av.  >  k  or  some  a.»  >  a, 
or  if  £„Ar.  5^  £*A». 

On  summation  of  (4.11)  over  all  matrices  A*,  and  A.*  with  the  same  grand 
total  vector  G..,  we  obtain  the  distribution 


P{Q..} 


7a..(Q?  Z)  a„p* 

,7(0,  2) 


(4.12) 
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(4.13)  7a..(0,  S)  =  £  E  [^*+]exP  (e^*•<^,*  +  2A.„¥*). 

From  (4.11)  and  (4.12)  we  derive  the  conditional  distribution  of  A *.  and  A.* 
for  given  <2.., 


(4.14) 


p{A+.,  AJa.. 


exp  (eA«.<S>*  +  2  A..**) 
7«.(e,  2) 


in  which  the  terms  containing  P  have  disappeared.  Next  we  may  sum  (4.14) 
over  the  matrices  A.„  to  find  the  distribution  of  A*,  for  given  <2. 


(4.15) 
where 

(4.16) 


X»„(Z) 


-e[£] 


Now  divide  (4.15)  into  (4.14)  to  obtain  the  conditional  distribution  of  one 
marginal  matrix,  given  the  other, 

tA  ~\  P2A  jr* 

A  J  yA"(Z) 

This  distribution  is  seen  to  be  independent  of  0.  This  means  that  we  may  esti¬ 
mate  the  stimulus  parameters  without  regard  to  the  personal  parameters,  instead  of 
which  we  use  the  observed  personal  marginals. 

Symmetrically,  of  course,  we  have 


yA  (0} 

(4.18)  P{A..la..}  =  7o  s*'1"**, 
where 

(4.19)  ™..(e)  = 


and  consequently 

(4.20)  =[!*.]  Si) 

so  that  the  personal  parameters  may  be  estimated  without  regard  to  the  stimulus 
parameters,  which  are  replaced  by  the  observed  stimulus  marginals. 

Finally,  from  (4.10)  and  (4.11)  we  obtain  the  conditional  distribution  of  the 
entire  observed  set  of  selection  vectors  for  given  marginal  matrices, 


P{A|A*.,  A.*} 


1 


(4-21) 
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Since  this  distribution  is  independent  of  both  sets  of  parameters,  it  may  serve  as 
a  basis  for  nonparametric  checks  of  our  model  (4.2).  It  should  be  noticed  that  all 
algebraically  possible  ^-matrices  are  equally  probable  if  the  model  holds.  For 
an  effective  check,  the  Ctvl  should  be  collected  into  groups  pointing  to  possible 
deviations  from  the  model  in  a  more  or  less  specified  direction. 

So  far,  our  results  run  parallel  to  those  obtained  for  the  item  analysis  model 
(3.2),  but  when  we  try  to  apply  (4.17)  and  (4.20)  for  estimation  purposes,  prob¬ 
lems  of  a  new  kind  arise.  Let  us  try  for  a  moment  to  think  of  (4.2)  as  fully 
specified,  for  example,  by  the  Foisson  law  (2.1)  and  (2.2),  with  the  functions 
<p(x),  yp(x),  and  p(x)  assumed  known,  and  x(z)  =  0.  From  (4.17)  it  follows  that 
the  vector 


(4.22)  g*  =  A.^* 

is  a  sufficient  statistic  for  2  with  distribution 


(4.23) 


pm*} 


gSS* 


say,  and  the  corresponding  conditional  distribution 


(4.24) 


P{AJA„  §} 


is  independent  of  2. 

At  this  stage  two  comments  are  called  for.  In  accordance  with  ordinary 
psychometric  practice,  the  possible  observations  xa),  •  •  •  ,  x(m)  might  have  been 
translated  to  a  set  of  scores,  z(1),  •  •  •  ,  z(m),  say.  Provided,  however,  that  our 
model  is  given  by  (4.2),  or  equivalently  by  (4.6),  the  values  ^i,  •  •  •  ,  ypm  will  be 
unaffected  by  the  choice  of  the  z.  Since 


(4.25) 


Si  =  L  a-<tVn> 

M 


i  =  1,  •  •  •  ,  k 


are  sufficient  for  estimating  the  <r,  the  fa  represent  the  only  way  in  which  the  x 
should  enter  the  estimation  of  the  a.  We  may,  therefore,  ignore  the  z  and  consider 
the  \f/  as  the  proper  scoring  function  of  x  so  far  as  the  estimation  of  the  stimulus 
parameters  is  concerned. 

Symmetrically,  of  course,  <p(x)  is  the  proper  scoring  function  for  the  estimation  of 
the  personal  parameters.  I  shall  discuss  the  particular  role  of  p(x)  on  some  other 
occasion. 

The  other  comment  refers  to  (4.24).  Due  to  the  independence  of  2,  this  rela¬ 
tion  might  be  employed  just  as  well  as  (4.21)  in  a  check  of  the  model.  There  is, 
however,  an  essential  difference  between  the  kinds  of  check  exerted  by  the  two 
formulas.  Relation  (4.21)  is  independent  both  of  parameters  and  scoring  func¬ 
tions  and,  in  fact,  even  of  the  dimensions  of  the  6  and  the  <r.  Thus,  checks  based 
on  (4.21)  refer  only  to  the  pure  framework;  can  the  given  data  be  represented  at  all 
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by  any  model  whatsoever  of  the  form  (4.2)  with  x(z)  =  0.  On  the  other  hand, 
(4.24)  depends  on  the  choice  of  the  function  and  therefore,  if  the  given 
data  have  passed  the  first  check,  the  check  based  on  (4.24)  would  be  interpreted 
mainly  as  a  check  on  the  suitability  of  the  choice  of  the  scoring  function  rf/(x). 

Now  it  may  very  well  happen  that  the  framework  according  to  the  first  test 
seems  quite  feasible  but  that  the  scoring  function  according  to  the  second  test 
seems  quite  inadequate.  In  general,  therefore,  we  may  be  better  off  if  we  leave  the 
scoring  function  unspecified  altogether,  and  try  to  estimate  it  from  the  data. 

Accordingly,  our  problem  now  is  to  estimate  the  m  +  k  elements  of  2  and 
from  an  observed  matrix  A of  order  (m,  k).  This,  of  course,  is  quite  possible 
insofar  as  (4.17)  gives  an  adequate  representation  of  these  data.  Estimation  and 
checking  may  be  combined  as  in  the  following  procedure. 

The  exponent  in  (4.17)  is  a  bilinear  form  in  the  \f/  and  the  a, 

(4.26)  2 A.J?*  =  H 

i  m 

Consider  now  for  a  moment 

(4.27)  t<(m)  =  *,<rt 

as  mk  algebraically  almost  independent  (apart  from  a  trivial  normalization) 
parameters  to  be  estimated  from  the  observed  numbers  a{.f .  Denote  the  estimate 
of  t,(m)  by  t(f\ 

(4.28)  =  T,°°. 

If  a,  and  therefore  also  Vv  are  one-dimensional,  $M)  is  an  estimate  of  the  product 
^yrt;  the  joint  distribution  of  these  estimates  is  known  in  principle.  As  a  con¬ 
sequence,  the  4M)  for  any  fixed  i  should  be  “stochastically  proportional”  to  the 
Z(.M) -values  and,  symmetrically,  the  £jM)  for  any  fixed  p  should  be  “stochastically 
proportional”  to  the  i)0 -values. 

In  case  our  data  satisfy  these  conditions,  (4.17)  would  seem  applicable  with 
the  c x  and  the  interpreted  as  scalars  which  may  be  estimated  from  the  and 
the  &K  Better  estimates  may  be  available  but  this  is  a  technical  matter  which 
I  shall  leave  aside  on  this  occasion. 

However,  it  may  equally  well  happen  that  the  proportionality  condition 
clearly  fails  to  hold.  Then  the  conclusion  is  that  the  assumption  now  at  stake, 
namely,  the  one-dimensionality  of  the  a  and  the  \p,  has  to  be  dropped.  If  so,  the 
alternative  assumption  obviously  is  that  the  parameters  and  the  scoring  function 
may  be  of  a  higher  dimension.  For  an  investigation  of  this  possibility  we  turn  to 
the  reinterpretation  of  our  formulas.  Accordingly  putting 

(4.29)  <r,  =  (an,  •  •  •  ,  an),  =  (^„i,  •  •  •  , 

we  have  to  interpret  r,(M)  in  (4.27)  as  the  inner  product  of  a,  and  Since 
still  estimates  rl^,  we  now  have 

=  H  0-iA^vx  +  WtM), 

X  =  1 


(4.30) 
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where  the  residuals  u1?  under  favorable  conditions  may  be  fairly  small  and  even 
approximately  normally  correlated. 

Formally,  the  system  of  relations  (4.30)  reminds  us  strongly  of  factor  analysis 
specification,  from  which,  however,  it  differs  in  a  main  respect  in  that  the  joint  distri¬ 
bution  of  the  /<(M)  is  in  principle  known  and  depends  only  on  the  basic  parameters 
(4.29).  In  particular,  the  variations  of  the  residuals  are  not  to  be  accounted  for 
by  a  particular  set  of  variability  parameters,  and  assumptions  about  variances 
and  covariances  of  the  residuals  that  are  not  corollaries  from  the  distribution 
of  the  /,(M)  would  be  inadmissible. 

Another  consequence  of  the  distribution  of  the  4M)  is  that  the  obvious  analogue 
of  an  old  and  troublesome  question  in  factor  analysis,  “When  to  stop  factoring?”, 
here  has  a  fairly  definite  answer  insofar  as  it  would  seem  pointless  to  try  to  add 
another  pair  of  elements  to  (4.29)  when  the  calculated  residuals  after  l  pairs 
already  tally  with  the  distribution  of  the  as  computed  from  the  /-dimen¬ 
sional  estimates  of  the  <x  and  the 

On  following  the  same  line  of  deductions  from  (4.20),  we  eventually  arrive  at 
a  complete  mastery  of  the  estimation  of  both  sets  of  parameters  and  of  both 
scoring  functions,  including  the  dimensionalities.  With  this,  finally  we  may  esti¬ 
mate  the  third  scoring  function  p(x )  from  ft.,  on  the  basis  of  (4.12).  In  a  certain 
sense,  p(x )  gives  the  final  specification  of  the  model  (4.6),  the  exponential  frame¬ 
work  having  been  agreed  upon. 

5.  Principles  of  comparison 

In  psychology  proper,  and  in  particular  in  its  applications,  there  seems  to 
be  a  strong  need  for  replacing  the  original  qualitative  observations  by  measur¬ 
able  quantities.  In  formulating  a  general  law  of  the  type  (4.2),  we  have  so  far 
replaced  the  observations  by  quantitative  parameters,  but  that  does  not  imply 
that  we  have  a  proper  measurement,  on  a  ratio  scale  or  on  an  interval  scale,  of 
the  individuals  or  of  the  stimuli  nor  even  that  a  proper  ordering  is  available.  This 
is  obvious  in  case  the  parameters  are  nonscalar  since  the  relations  <  and  >  seem 
hard  to  extend  to  higher  dimensions.  But  even  in  case  of  scalar  parameters,  as  we 
shall  see,  trouble  may  arise  when  m  exceeds  2. 

In  an  attempt  to  make  clear  what  may  be  achieved  in  these  respects,  within 
the  class  of  models  considered  here,  I  shall  begin  with  simply  dropping  both  meas¬ 
uring  and  ordering  as  possibly  too  ambitious  concepts.  More  modestly  I  shall  in¬ 
quire  into  the  possibilities  for  just  comparing  individuals  and  comparing  stimuli. 
In  doing  so,  I  shall,  however,  formulate  four  requirements  that  to  my  mind  seem 
indispensable  for  well-defined  comparisons.  Preliminarily,  it  may  be  noted  that 
in  order  to  compare  stimuli  we  have  to  apply  them  to  some  adequately  chosen 
individuals,  and  similarly,  that  in  order  to  compare  individuals  in  a  given  respect, 
we  must  use  some  adequate  stimuli. 

Now  the  requirements  are  as  follows. 

The  comparison  between  two  stimuli  should  be  independent  of  which  particular 
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individuals  were  instrumental  for  the  comparison;  and  it  should  also  he  independent 
of  which  other  stimuli  within  the  considered  class  were  or  might  also  have  been 
compared. 

Symmetrically ,  a  comparison  between  two  individuals  should  be  independent  of 
which  particular  stimuli  within  the  class  considered  were  instrumental  for  the  com¬ 
parison;  and  it  should  also  be  independent  of  which  other  individuals  were  also 
compared,  on  the  same  or  on  some  other  occasion. 

Returning  now  to  (4.1)  we  shall  obviously  be  in  a  favorable  position  if  this 
equation,  whatever  £  may  be,  has  a  unique  solution  with  regard  to  0  for  any 
given  o,  and  vice  versa,  that  is,  if  the  functions 

(5.1)  0  =  \(o,  £)  and  o  =  k(0,  £) 

are  uniquely  defined. 

In  this  situation  we  may  compare  two  individuals  by  means  of  the  following 
principle  of  equivalent  stimuli:  To  any  chosen  £,  which,  it  will  be  recalled,  is  not 
a  response  but  a  distribution  parameter,  we  may  for  two  individuals  with  the 
parameters  0  and  0 '  find  the  stimuli  o  and  o'  that  produce  the  reaction  £.  Here  o 
and  a'  are  called  equivalent  stimuli  for  the  comparison  of  0  and  0'.  Now  letting 
£  vary  throughout  its  entire  range,  we  obtain  a  series  of  equivalent  pairs  of  stimuli, 
the  relationship  between  which  constitutes  a  well-defined  comparison  between  0  and  0'. 
In  fact,  for  any  stimulus  o  as  applied  to  the  individual  0  we  may  look  up  which 
stimulus  o'  corresponds  to  it  in  the  individual  0'. 

This  comparison  fulfills  the  fourth  of  our  requirements.  In  fact,  considering 
a  third  individual  with  the  parameter  0"  it  is  easy  to  see  that  if  the  first  compari¬ 
son  is  followed  by  a  comparison  of  6"  to  0'  we  shall  get  the  same  a"  as  would  have 
obtained  by  direct  comparison  of  6"  to  0. 

Similarly,  we  may  compare  two  stimuli  by  a  principle  of  equivalent  individuals : 
To  any  chosen  £  we  may  for  the  two  stimuli  considered,  with  parameters  <r  and 
o'  find  the  two  individuals,  with  parameters  0  and  0',  which  react  with  £  upon 
o  and  o'.  The  relationship  between  all  such  equivalent  individuals  constitutes  a 
well-defined  comparison  between  the  stimuli  considered.  Obviously  such  comparisons 
are  also  transitive,  thus  fulfilling  the  second  of  our  requirements. 

On  the  other  hand,  if  equation  (4.1)  for  some  0  or  for  some  o  has  more  than 
one  solution,  then  the  transitivity  cannot  hold  without  exception.  And  if  the 
equation  for  some  0  or  for  some  o  has  no  solution  corresponding  to  a  given  £, 
then  a  comparison  cannot  always  be  carried  out. 

Thus,  the  existence  and  the  uniqueness  of  the  functions  (5.1)  are  necessary  and 
sufficient  conditions  for  unrestricted  and  transitive  comparability  of  both  individuals 
and  stimuli. 

The  rule  of  transitivity  seems  to  generalize  one  of  the  most  fundamental 
properties  of  measurement.  If,  for  instance,  we  wish  to  measure  the  distance 
between  two  points  A  and  C  on  a  straight  line  we  may  do  it  directly  or  we  may 
interpose  a  third  point  B,  measure  the  distance  AB,  and  on  top  of  that  measure 
the  distance  BC  to  obtain  the  total  AC. 
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6.  On  measurement 

.Asa  corollary  we  may  conclude  that  in  the  case  of  transitivity  the  dimensions 
of  £,  0,  and  <r  must  be  equal. 

For  the  model  (4.2)  this  implies  that  \j/(x)  must  be  a  linear  transform  of  <p{x), 
which  is  always  so  in  the  trivial  case  l  ^  m  but  has  to  be  tested  in  the  case  l  <  m. 
Equivalently,  this  implies  that  with  an  appropriate  normalization  of  the  a  we 
have 

(6.1)  i(x)  s  <p(x), 

in  which  case  the  model  simplifies  to 

(6.2)  P{x |W  =  exp  BVW+  pW], 

y{vV)  (Ti) 

where 

(6.3)  i;vi  =  0y  +  a,. 

According  to  this  formula,  which  corresponds  to  the  logarithmic  version  of 

(2.2) ,  we  may  introduce  what  may  be  called  the  l-dimensional  measurement  of  the 
'personal  and  the  stimulus  parameters,  choosing,  for  example,  a  certain  stimulus, 
i  =  0  say,  as  our  reference  point  with 

(6.4)  a„  =  (0,  ...  ,0). 

With  such  a  choice  all  other  parameters  0„  and  <r,  are  uniquely  determined  by 
virtue  of  the  empirically  established  law  (6.3).  The  parallel  between  this  type  of 
approach  and  the  introduction  of  mass  and  force  as  measurable  concepts  in 
classical  dynamics  has  been  thoroughly  discussed  in  chapter  7  of  [3]. 
Accordingly,  the  first  and  the  third  requirements  also  are  fulfilled  in  the  model 

(6.2) . 

A  discussion  of  these  requirements  as  separated  from  (4.2)  awaits  another 
occasion. 
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